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Abstract 

We consider the problem of locating a public facility on a tree, where a set of n strategic agents 
report their locations and a mechanism determines, either deterministically or randomly, the location 
of the facility. Game theoretic perspectives of the facility location problem have advanced in two 
main directions, one focusing on the characterization of strategyproof (SP) mechanisms and the other 
quantifying how well various objective functions can be approximated by SP mechanisms. The present 
paper advances the theory in both directions. First, we give a family of randomized SP mechanism for 
any tree network. Quite miraculously, all of the deterministic and randomized SP mechanisms that have 
been previously proposed for various objective functions are special cases of our mechanism. Thus, our 
mechanism unifies much of the existing literature on SP approximation for facility location problems. 
Second, we use our mechanism to prove new bounds on the approximation of the minimum sum of 
squares (miniSOS) objective in tree networks. For randomized mechanisms, we propose a mechanism 
that gives 1 .5-approximation for the miniSOS function on the line, and show that no other randomized 
SP mechanism can provide a better approximation. For tree networks, we propose a mechanism that 
gives 1.83-approximation. This result provides a separation between deterministic and randomized 
mechanisms, as we show that no deterministic mechanism can approximate the miniSOS function 
within a factor better than 2, even for the line. Together, our study establishes a step toward the 
characterization of randomized SP facility location mechanisms on a tree, and provides a fundamental 
understanding of the miniSOS objective function. 
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1 Introduction 



In facility location problems, a social planner has to determine the location of a public facility that needs 
to serve a set of agents. Once the facility is located, each agent incurs some cost that depends on the 
distance from her ideal location to the chosen location of the facility. This class of problems is realized 
in many scenarios, including, for example, locating a server in telecommunication networks or locating 
a library or a fire station in a road network. Facility location problems arise not only in physical settings 
like the ones just described, but also in more virtual settings where the agents' opinions or preferences 
can be represented as their locations, and a single outcome has to be chosen. As an example, consider a 
set of students sitting in a classroom with an air conditioner, where every student has her most preferred 
temperature, and a single temperature has to be chosen. In all of these examples, one "location" has to be 
chosen, and every agent would Uke it to be as near as possible to her most preferred location. 

What is the best method to settle the naturally conflicting preferences of the agents? In other words, 
what is the best way to determine the location of the facility, given the agents' locations? This question, 
and related ones, have been extensively studied in the literature, from various conceptual and algorithmic 
perspectives; see, e.g.. Marsh and Schilling [?], the book by Handler and Mirchandani [?], and the body of 
literature concerning the performance of a Condorcet pointj [?, ?, ?]. Starting with the work of Moulin [?], 
this problem was also studied from a game-theoretic perspective, where the agents are assumed to be 
strategic, i.e., report their locations in a way that will minimize their individual costs. The game theoretic 
aspects of the facility location problem advanced in two main directions, as described below. 

The first direction seeks to characterize strategyproof (SP) mechanisms; i.e., mechanisms that induce 
truthful reporting as a dominant strategy. This is a crucial attribute since in its absence, strategic agents 
may misreport their locations. Moulin [?] and later Schummer and Vohra [?] provided characterizations of 
deterministic SP mechanisms on line, tree, and cycle networks. While the work of Moulin [?] concentrated 
on general single-peake^l preferences [?], Schummer and Vohra [?] considered the special case in which 
the cost incurred by an agent is the length of the shortest path from the facility to her location. These 
results were later extended to various metric spaces (see, e.g.. Border and Jordan [?]). 

The second direction, advocated more recently by Procaccia and Tennenholtz[?], seeks to study the 
approximation ratio that can be obtained by an SP mechanism with respect to a given objective function. 
This agenda, often termed "approximate mechanism design without money", has led to extensive work 
on several domains, including facility location [?, ?, ?, ?, ?, ?], machine learning [?], and matching [?, 
?]. Unlike the traditional motivation for approximation, originating from computational hardness, here, 
approximation is used to achieve strategyproofness. 

The approximation ratio of a mechanism is defined with respect to a given objective function, and 
the standard worst-case notion is being applied. The two objective functions that have been prominently 
featured in the literature are the minisum (i.e., minimizing the sum of agents' costs) and the minimax 
(i.e., minimizing the maximum cost of any agent) functions. While it is well known that the optimal 
location on a tree with respect to the minisum function can be obtained by an SP mechanism (in particular, 
by the median), it has been shown by Procaccia and Tennenholtz [?] that no deterministic (respectively, 
randomized) SP mechanism on a line can achieve a better approximation than 2 (resp., 1.5) with respect 
to the minimax function. Thus, the feasible approximation of SP mechanisms depends on the specified 
objective function. 

In this work, we propose a third objective function — minimizing the sum of squares of distances 
(hereafter miniSOS). The miniSOS function is highly relevant in many economic settings, and is related to 
central notions in other disciplines, such as the centroid in geometry, or the center of mass in physics. Of 
particular interest is the close relation to regression learning, where there is a one-to-one mapping between 

'a Condorcet point is a location that is preferred to any other location by more than half the agents. 

With single-peaked preferences, every agent is associated with an ideal location, considered to be her peak, and the closer 
the facility is to an agent's peak, the most preferred it is. 
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facility location on a line and regression learning (restricted to the class of constant functions). Moreover, 
the miniSOS objective has been given a rigorous foundation by Holzman [?] using an axiomatic approach. 
In particular, Holzman formulated three axiom^ for locating a facility on a line or a tree, regarded as 
sensible requirements, and showed that the unique objective function that satisfies the three axioms is the 
miniSOS objective. Following the above motivation and the axiomatic foundation laid by Holzman, it 
is only natural to study the approximation that can be obtained by SP mechanisms with respect to the 
miniSOS function. 

1.1 Our contribution 

Our contribution spans both the characterization and the approximation agendas with respect to the line 
and tree networks. The line topology is motivated by any one-dimensional decision that is made by aggre- 
gating agents' preferences. The tree topology is particularly motivated by the telecommunication networks 
example, where a tree topology corresponds to any hierarchical network. Notably, in computer networks, 
an agent can easily manipulate its perceived network location by generating a false IP address. 

Characterization The characterization of SP mechanisms for facility location settings has been studied 
thus far mainly with respect to deterministic mechanisms. A randomized mechanism receives a location 
profile and returns a probability distribution over locations. It is well known that randomization can be 
very powerful in social choice settings. In cases where the cost incurred by an agent is her distance from 
the facility (as in the model of Schummer and Vohra [?]), it seems natural to define an agent's cost as her 
expected cost with respect to the given probability distribution. This is the approach taken by Procaccia 
and Tennenholtz [?], and many studies thereafter [?, ?, Quite surprisingly, while there has been a surge 
of research on randomized SP mechanisms for facility location problems, there was very hmited effort on 
characterizing randomized SP mechanism^. 

We make a first step in providing a characterization for randomized SP mechanisms on a tree network. 
In particular, we design a family of randomized mechanisms, termed parameterized boomerang (PB), 
which is SP for any tree network. Quite miraculously, all of the mechanisms that have been devised re- 
cently within the facility location domain (and shown to give tight bounds with respect to various objective 
functions) can be formulated as special cases of mechanism PB. The generality and strength of mechanism 
PB is further illustrated through the analysis of the miniSOS objective, where special cases of mechanism 
PB are shown to achieve good approximation results on line and tree networks with respect to the min- 
iSOS objective. Thus, the characterization of randomized SP mechanisms is not only interesting in itself 
but also equips one with a large family of SP mechanisms, and can be served as a useful tool for studying 
the approximation ratios with respect to various objectives. Mechanism PB is presented in Section [3] 

Approximation We provide approximation results with respect to the miniSOS objective, for line and 
tree networks, and for deterministic and randomized mechanisms. Notably, all the mechanisms that are 
devised in this paper are special cases of mechanism PB. 

^ In particular, Holzman formulated three axioms, which are (roughly speaking): (i) unanimity, stating that if all the agents 
report the same location, then the reported location should be chosen, (ii) Lipschitz, which is a continuity requirement; and (iii) 
invariance, stating that an agent who moves to a location that is equidistant from the outcome from the same direction will not 
affect the chosen outcome. 

■'These works, however, focused on approximating various objective functions by SP mechanisms and not on characterizing 
SP mechanisms. 

^An exception is the work by Ehlers et al. [?], which characterizes randomized SP mechanisms, but is different from our 
work in two respects. First, the characterization of Ehlers et al. applies only to a line network, while we are interested in the more 
general case of a tree. Second, they use a different notion of preferences over probability distribution (in particular, preference 
over probability distributions is defined in terms of first-order stochastic dominance), and, as a result, uses a different notion of 
strategyproofness. This difference has significance effects, for example, on lower bound results for approximation. 
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Line networks (SectionHll. We show that the median gives a 2-approximation deterministic SP mech- 
anism, and that no deterministic SP mechanism can achieve a better approximation ratio. As in other 
studies [?, ?, ?, ?], randomized mechanisms are shown to provide better bounds. In particular, we present 
a randomized SP mechanism that provides a 1.5-approximation; the mechanism chooses the average loca- 
tion with probability ^ and a random dictator with probability ^. In addition, we show that no randomized 
SP mechanism can achieve a better bound. The proof technique used to construct the lower bound involves 
subtle analysis and is perhaps one of the main technical contributions of the paper. Interestingly, while the 
minimax and the miniSOS functions induce different optimal solutions, and different optimal SP mecha- 
nisms, they admit the exact same approximation bounds with respect to both deterministic and randomized 
mechanisms. 

Tree networks (Section |5]l. First, we show that the median gives a 2-approximation with respect to 
the miniSOS objective. This result is tight with respect to deterministic mechanisms, following the lower 
bound established on the line. Our main result in the approximation regime is the construction of a ran- 
domized SP mechanism that gives a 1.83-approximation for any tree network. This result establishes a 
separation between deterministic and randomized mechanisms, as no deterministic mechanism can pro- 
vide a better approximation than 2. 

1.2 Open Problems 

Our study leaves many questions for future research. The first natural challenge is to provide a full char- 
acterization for SP randomized mechanisms on a tree. For the miniSOS objective function, closing the 
approximation gap for the randomized mechanisms on a tree remains open. In addition, it would be in- 
teresting to extend the approximation results for the miniSOS function to other networks topologies, such 
as a cycle and general networks, similar to the studies by Alon et al. [?, ?] with respect to the minimax 
and the minisum functions. An additional direction is to consider a different individual cost function. For 
example, in applications in which agents are more sensitive to distances within the range of high distances, 
a convex cost function seems plausible (a possible example might be the speed of an Internet connection). 
Finally, the three different social functions that have been studied thus far can be considered as special 
cases of the £-norm distance, with minisum, miniSOS, and minimax corresponding to the 1-norm, 2-norm, 
and oo-norm, respectively. It is apparent that while for the minisum function, the optimal location can be 
obtained in an SP mechanism, this is not feasible for either 2- or oo-norms, and the same approximation 
bounds apply in both cases. Generalizing this result to any £-norm is an additional stimulating direction. 

2 Model and Preliminaries 

We use the model of Schummer and Vohra [?], where the network is represented by a graph G, formalized 
as follows. The graph is a closed, connected subset of Euclidean space G C R'^. The graph is composed 
of a finite number of closed curves of finite length, known as the edge^ The extremities of the curves are 
known as vertices (or nodes). An important class of graphs, which is the focus of this paper, is tree graphs 
— graphs that contain no cycles. 

The path between two points a, 6 G G is denoted by pathcia, b). The distance between two points 
a,b ^ G, denoted dcia, b), is the length of the (unique) path between a and b. We extend the definition of 
distance between points to distance between a point and a path as follows. Given a point c G G and a path 
pathcia, b), the distance between c and path{a, b), denoted dcipathGia, b), c), is the shortest distance 
between c and any point on path{a, b); i.e., dcipathcia, b), c) = 'rninifzpatha{a,b)dG{h c)- When clear in 
context, we omit the subscript G. 

* Note that while this model is expanding upon the notion of an interval, it is not analyzing full-dimensional, convex subsets of 
Euclidean space. Rather, travel is restricted to a road network, where convex combinations of locations are typically not feasible. 
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Let N = {1, . . . , n} be a set of agents. We sometime use [n] to denote the set of agents N. Each 
agent i ^ N has an (ideal) location Xi £ G (agents can be located anywhere on G). The collection 
X = (xi, . . . ,Xn) € G" is referred to as the location profile. 

A deterministic mechanism is a function / : — )■ G that maps the agents' reported locations to the 
location of n facility (which can be located anywhere on G). If the facility is located at y G G, the cost of 
agent i is the distance between xi and y; i.e., cost{y, Xi) = d{y, Xi). 

A randomized mechanism is a function / : G" — )• A(G), which maps location profiles to probability 
distributions over G (which randomly designate the facility location). Let P € A(G) be a probability 
distribution over G. If f{x) = P, then the cost of agent i is the expected distance of the facility location 
from Xi; i.e., cost{P, Xi) = Ey^p[cost{y, Xi)]. When clear in the context, we write y ~ /(x) for ease of 
presentation. 

A mechanism is called strategyproof (SP), or truthful, if no agent can benefit from misreporting her 
location, regardless of the reports of the other agents. Formally, in our scenario, this means that for all 
X G G", for all i G N, and for all x'^ € G, it holds that cost(/(x), Xj) < cost{f{x'-,x-i),Xi), where 
X-i = (xi, . . . , Xj_i, Xj+i, . . . , Xn) is the profile of all locations, excluding agent i's location. 

The quality of a facility location is usually evaluated with respect to some target social function. Given 
a location profile x = (xi, . . . , x^) and a facility location y, the social cost of y with respect to x is given 
by a function sc{y, x). The social cost of a distribution P with respect to x is sc{P, x) = Ey^p[sc{y, x)]. 

Given a social cost function, location y € G is said to be optimal with respect to a profile x if 
sc(y,x) = miny/^asc{y' j'k). An optimal location is denoted by Opt{G,'K). When clear in the con- 
text, we simply write Opt. In addition, we often abuse notation and use Opt to refer to the social cost of 
an optimal location. 

A mechanism / is said to provide a-approximation with respect to a social cost function so if for every 
graph G and every location profile x, sc{f{:K.),:x.)/sc{Opt, x) < a; that is, the mechanism always returns 
a solution that is an a factor of the optimal solution. 

In this paper we are interested in optimizing the sum of squared distances (SOS) function; that is, 
sc(y,x) = J2ieN ^(y T ^i)"^ ■ "^^^ objective function is extremely important from both normative and 
positive perspectives, as discussed in the introduction. 

Given a profile x, the median of x in a tree G, denoted by ju(G, x), is defined as follows. We start from 
an arbitrary node (induced by G) as a root. Then, as long as the current location has a subtree that contains 
more than half of the agents, we smoothly move down this subtree. Finally, when we reach a point where 
it is not possible to move closer to more than half the agents by continuing downwards, we stop and return 
the current location. 

We continue with several graph theoretic definitions and lemmas. At this point, it is necessary to 
emphasis the difference between a location profile, which was defined earlier and is tightly coupled with a 
set of agents, and a location vector, which is a set of locations in the graph. 

Definition 1 Given a tree G and a point x G G, let T{G,x) be the set of subtrees defined as follows. If x 
is a tree node (with degree d^), then T(G, x) = {Ti, . . . , where Ti is the subtree of descendant i 

rooted at x. If x is not a node (i.e., it is a point on an edge), then T{G, x) = {Ti, T2}, where Ti and T2 
are the respective left and right subtrees rooted at x. 

Definition 2 Let G be a tree, y E G*" be a location vector, and w be a probability vector of size 
m. The weighted average location with respect to G, y and w, denoted wAvg{G, y, w), is a point 
in G which minimizes the weighted sum of squared distances from the locations in y; i.e., wAvg G 

The following lemmas will be required in the sequel. Their proofs are deferred to Appendix [A] 
Lemma 3 Let y € G™ be a location vector, and w a probability vector of size m. It holds that a = 
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wAvg{G, y, w) if and only if for every Tj G T(G, a), 

Wid{yi,a)< ^ Wid{yi,a). 

i€lm]:yi<=:Tj ielm\:yi<^Tj 

Corollary 4 Le? y € G"* fte a location vector, and w a probability vector of size m. The weighted average 
location with respect to G, y and w is unique. 

Lemma 5 Let y, y' € G"^ be location vectors, and w be a probability vector of size m. For every 
i € [m], let 6i = d{yi, y[) . Let a = wAvg{G, y, w) and a' = wAvg{G, y', w). It holds that d{a, a') < 

Lemma 6 Let x G G" be a location profile. Let a,b G G be two locations, and let Th G T{G, a) and 
Ta G T(G, h) such that they contain path{a, b), and assume that all of the agents are either in G\Ta, in 
G\Tb, or on path{a, b). Additionally, assume that Opt is located in G\Ta. Then, sc{a, x) — sc{b, x) = 

-\N\d{a, 6)2 - 2d{a, b) (EigG\T6 d{xi, a) - Y^ian ")) 

Corollary 7 Let x G G" be a location profile, and let a,b G G as described in Lemma^ Let x' be 
constructed as follows. Let xj G G\Ta be the most far agent from b in G\Ta, and for each i such 
that Xi G G\Ta, locate x[ on path{b,Xj) such that d{b,x[) = d{b,Xi). Then, sc(a, x) — sc(6,x) = 
-\N\dia, bf + 2|iV|d(o, b)d{a, Opt') 



3 Randomized SP Mechanisms on a Tree 

In this section we introduce a family of randomized SP mechanism for locating a facility on a tree. Unless 
otherwise stated, the graph G in this section is assumed to be a tree. 

The following notion of a boomerang mechanism is a key concept in our construction. 

Definition 8 A deterministic mechanism f is said to be a boomerang mechanism if for every location 
profile X, agent i, and point x[, cost{f{x.'), Xi) — cost{f{x),Xi) = d(/(x'), /(x)), where x' = (x^, 

That is, a boomerang mechanism is one in which a deviating agent fully absorbs the effect of her deviation 
on the facility location. Clearly, every boomerang mechanism is SP. 

Several examples of boomerang mechanisms follow: (The proof is left to the reader.) (i) dictatorship; 
i.e., where there exists i ^ N such that for every x, /(x) = Xj. (ii) median (on a tree), (iii) k'th-location 
(on a line); also known as generalized median [?]. 

We are now ready to introduce the family of randomized SP mechanisms for tree networks. This family 
is presented as a parameterized mechanism, called "parameterized boomerang". 

Mechanism parameterized boomerang PB: Let f = (/i, . . . , fm) be a collection of boomerang 
mechanisms. For every i G [m], let yi = /i(x), and let y = (t/i, . . . , y^)- Let w be a probability distribu- 
tion supported on m elements, and let a = 'wAvg{G,y,w). The facility location is chosen according to 
the following probability distribution: 

• for every i G [m], choose /i(x) with probability ^Wi. 

• choose a with probability i. 

We refer to the two components of the probability distribution as the boomerang component and average 
component, respectively. Note that every boomerang mechanism is a special case of PB, with m = 1. 

The following theorem, whose proof is deferred to Appendix El establishes the strategyproofness of 
Mechanism PB. It is followed by an immediate corollary. 



5 



Theorem 9 Mechanism PB is SP. 

Corollary 10 Every fixed probability distribution over PB mechanisms is SP. 

In recent years, various SP mechanisms have been proposed in the literature for the facility location 
problem on the line with the objective of approximating different social objectives, such as the minisum 
and the minimax functions. The following proposition shows that all of the mechanisms that have been 
proposed in this context are special cases of Mechanism PB (or a probability distribution over PB mecha- 
nisms). The proof is deferred to Appendix IbI 

Proposition 11 The following mechanisms on the line are special cases of Mechanism PB(or a probabil- 
ity distribution over PB mechanisms). 

1. A;-location. Examples of this mechanism are the median mechanism, which is known to minimize 
the sum of distances, and the leftmost agent mechanism, which provides a 2-approximation for the 
minimax objective [1] (which is tight with respect to deterministic mechanisms). 

2. left-right-middle (LRM) [1 ]. LRM chooses the leftmost agent with probability \, the rightmost agent 
with probability \, and their middle point with probability ^. It provides a (tight) \. ^-approximation 
for the minimax objective. 

3. random dictator (RD). RD chooses every agent with probability K It provides a 2 — ~approximation 
for the minisum objective [7 ]. We will later establish that RD gives a 2-approximation for the 
miniSOS objective (see Theorem\16\l. 

While it is evident from the last proposition that Mechanism PB is very powerful, it imposes a sufficient 
condition for strategyproofness, but not a necessary one. This is established by Example [24l that could be 
found in Appendix |Bj 



4 SP Mechanisms on a Line 

In this section, we study how well SP mechanisms can approximate the miniSOS objective — minimizing 
the sum of squared distanced — on a line. In the deterministic case, we present a mechanism that provides 
2-approximation, and show that no SP deterministic mechanism can achieve a better ratio. In the random- 
ized case, we construct a mechanism that provides 1.5-approximation, and show that this result is tight. 
AU missing proofs in this section are deferred to Appendix ICl 

In this section, the graph is essentially the real line, M. It is easy to verify that an optimal location in 
this case is simply the average. 

Claim 12 Given a location profile x, the optimal facility location with respect to the miniSOS objective is 
the average location; i.e.. Opt = argminysc{y, x) = — 

The following lemma proves extremely useful in establishing the lower bounds throughout this sec- 
tion. In particular, it helps us relate joint deviations (i.e., coordinated deviations by a subset of agents) to 
unilateral deviations (i.e., deviations by a single agent). 

Lemma 13 Let a,b,c G Mbe three locations such that a < b < c, with at least one strict inequality, and 
for every m £ [n], let (respectively, x^) be a location profile in which n — m agents are located at a, 
and m agents are located at c (resp., b). Let f be a randomized mechanism. If f is an SP mechanism, then 
E[\c-y^\] < E[\c-y"'\] and E[\b - y""]] < E[\b - y'^]], where y^ ~ /(x") andy"^ ~ /(x™). 

4.1 Deterministic mechanisms 

Theorem 14 Given a location profile x, the mechanism that chooses the median location in x is an SP 
2-approximation mechanism for the miniSOS objective. 
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Notably, this mechanism is a special case of mechanism PB. 

The following theorem shows that factor 2 is tight with respect to deterministic SP mechanisms. 

Theorem 15 Any deterministic truthful mechanism has an approximation ratio of at least 2 for the min- 
iSOS objective. 

4.2 Randomized mechanisms 

A natural candidate of a randomized mechanism to be considered in our context is the random dictator 
(RD) mechanism, which chooses each agent's location with probability -. This mechanism is SP and is 
known to provide a (2 — -) -approximation with respect to the minisum objective function (See [?] and 
[?]). The following theorem shows that the RD mechanism provides a 2-approximation for the miniSOS 
objective. More precisely, for every location profile, the RD mechanism yields an SOS cost that is exactly 
twice the cost of the optimal location. This is established in the following theorem. 

Theorem 16 For every location profile, the RD mechanism yields an SOS cost that is exactly twice the 
optimal SOS cost 

As shown in Proposition [TH the RD mechanism is a probability distribution over PB mechanisms. 

Apparently, the RD mechanism does not perform better the deterministic median mechanism. Yet, 
this mechanism turns out to be useful when integrated within a more sophisticated mechanism, as shown 
below. 

Mechanism 1 Given x G i?", choose the average point with probability ^, and apply the RD mechanism 
with probability ^ (i.e., for every i N, Xi is chosen with probability ^J. 

Theorem 17 Mechanism\l}is an SP 1.5-approximation for the SOS objective. 

Notably, while approximation in its usual sense looks at the worst-case ratio between the expected cost 
of the mechanisms solution and the cost of the optimal solution, in this case the 1.5-approximation applies 
not only in the worst-case notion; rather, this is the exact approximation achieved for every location profile. 

Surprisingly, Mechanism [T] provides the best possible approximation; that is, no SP mechanism, ran- 
domized or not, can achieve a better approximation ratio than 1.5. This bound is established in the next 
theorem. Due to lack of space, we give an overview of the proof, and defer the full proof to Appendix O 

Theorem 18 Any randomized SP mechanism has an approximation ratio of at least 1.5 for the miniSOS 
objective. 

Proof Sketch. Assume on the contrary that there exist an SP mechanism /(x) and e > such that /(x) 
yields an approximation ratio of 1.5 — e, and let y be a random variable that is distributed according to f{x). 
Starting at the original profile, we construct a series of profiles, on which we repeatedly apply Lemma [T3l 
Using this technique, we prove that there exist some a € M and a location profile x in which | agents are 
located at a and | agents are located at a + 4, such that E[\y — (a + 1)| + |y — (a + 3)|] > 3 — 2e. We 
then use the last inequality to show that E[sc{y,x.)] > 6n — 4ne. By observing that the optimal facility 

location for this profile is at a + 2 (which yields an SOS cost of An), we get ^^""opi"^^^ > ^"4n"' = 1-5 - e, 
and a contradiction is reached. ■ 

5 SP Mechanisms on a Tree 

In this section, we study the miniSOS objective with respect to locating the facility on a tree. In the de- 
terministic case, we show that the median of a tree provides a 2-approximation for the miniSOS objective, 
and show that no SP deterministic mechanism can achieve a better ratio with respect to miniSOS. In the 
randomized case, we construct an instance of Mechanism PB, which obtains a 1.83-approximation. 
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Figure 1: An illustration of the iterative process described in the proof of Theorem [19] ^ is illustrated 
on the left, and is illustrated on the right, where the dashed line represents the new edge. 

5.1 Deterministic mechanisms 

It is well known that the mechanism that chooses a median of a tree is SP (see, e.g., [?]). The following 
theorem establishes that a median also gives a 2-approximation for the miniSOS objective, which is tight, 
according to Theorem [151 The theorem is followed by a sketch of the proof. The full proof is deferred to 
Appendix iDl 

Theorem 19 The median of a tree is an SP 2-approximation mechanism for the miniSOS objective. 

Proof Sketch. Given an instance (G,x), we iteratively transform it, in a way that can only make the 
approximation ratio obtained by the median worse, and eventually prove the desired approximation ratio 
on the final instance. Let and Opt denote the respective median and optimal location in the original 
profile. The iterative process proceeds as follows. As long as there exists an agent j such that xj's subtree 
is rooted at the open interval path{iJ,, Opt), pick such an agent j, create a new edge of length d{xj, Opt), 
rooted at Opt, and locate Xj at its tip (see illustration in Figure [T]). It can be proved that the median and the 
optimal location did not change as a result of this transformation, and that the optimal cost did not change 
either. The cost of the median, however, can only increase. Therefore, the approximation ratio can only 
get worse by each transformation. Upon termination of this process, we prove the desired approximation 
ratio on the final instance by reducing it to the deterministic scenario on a line and applying Theorem [141 



5.2 Randomized mechanisms 

In this section we present an instance of mechanism PB, which obtains 1.83-approximation for trees, with 
respect to the miniSOS objective. Our randomized mechanism uses the following deterministic mechanism 
as a building block. It is followed by a proposition, whose proof is deferred to Appendix iDl 

Mechanism dictatorial-generalized-median (DGM): Mechanism DGM receives as pa- 
rameters an index i G [n], and a fraction q G (1/2, 1]. The facility location is chosen deterministically, 
with respect to i and q, as follows. Fix the point X j as the root of the tree, and denote the current location 
a. Then, as long as there exists a subtree in T{G, a) that contains at least fraction q of the agents (this is 
well defined, since q > 1/2), smoothly move down this subtree. Finally, when we reach a point where it 
is not possible to move closer to at least fraction q of the agents by continuing downwards, we stop and 
return the current location. 

Proposition 20 Mechanism DGM is a boomerang mechanism. 

Note that mechanism DGM is not a boomerang mechanism for g < 1/2, as a beneficial misreport can 
exist, depending on the tie-breaking rule. 

With this we are ready to introduce our randomized mechanism. 
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Mechanism randomized DGM: Mechanism randomized DGM receives as a parameter a fraction 
q G (1/2, 2/3], and applies mechanism PB with the following parameters: m = n, Wi = I / n for every 
i G [n], and for every i G [n], /j(x) is mechanism DGM with parameters i and q. 

Interestingly, the following property holds. Its proof is deferred to the appendix. 

Lemma 21 For every tree G, the points fi (x) , . . . , fn (x), calculated by Mechanism randomized DGM, 
are located on a single path. 

In the remainder of this section, we shall use the notation suggested in Lemma 1271 i.e., for every 
i G [n], let yi = fi{'x.). This is followed by another notation - avg{y), which represents the average 
location on path{yi,yn), given the locations in y. 

The main result of this section establishes that Mechanism randomized DGIVI, with q = 2/3, obtains 
an approximation ratio of 1.83 for trees. 

Theorem 22 Let G and x be a tree and a location profile, respectively. Mechanism randomized DGM 
with q = 2/2> obtains an approximation ratio o/1.83 with respect to the miniSOS objective. 

Before presenting the proof sketch of the theorem, we observe that it can be assumed w.l.o.g that 
2/1 7^ Un, as the following lemma establishes. 

Lemma 23 Ifyi =1/2 = • • • = 2/n. then the approximation ratio obtained by Mechanism randomized 
DGM is at most 1.5. 

We are now ready to present an overview of the proof of Theorem l22l 

Proof Sketch. The proof of this theorem requires several lemmata, which correspond to various trans- 
formations that are needed in order to prove the desired approximation ratio. We shall describe the proof 
schematically along with a graphical illustration, but defer the formal statements of the lemmata and their 
proofs to Appendix iDl Following Lemma [23l we assume that yi ^ yn- Additionally, we assume, without 
loss of generality, that d{yi,yn) = 1, and scale the graph accordingly. We let Opt denote the closest 
location on path{yi,yn) to Opt, i.e.. Opt = mini^pai^y^ y^-^d{Opt, I). For ease of presentation, we refer 
to Mechanism randomized DGIVI with q = 2/3 as "the mechanism". 

The proof is established as follows. Given a graph G and a location profile x, we run them through 
various transformations, and show that the approximation ratio obtained by the mechanism could only get 
worse in every transformation. Eventually, we prove that the mechanism provides a 1.83-approximation 
ratio on the final graph and location profile, which implies an upper bound of 1.83 on the original instance. 

More specifically, given an instance (G, x), we proceed as follows: 

• We transform the graph repeatedly according to Lemma [22l which results in a graph in which all of 
the agents that are not located on path{yi,yn) are rooted at either yi, y„, or Opt (see Figure 2). 

• Assuming that avg{y) is on path{yi,Opt) (the proof follows analogously if avg{y) is on 
path{Opt, yn)), we then transform the graph according to Lemma [28] and Lemma [29] such that all 
the agents rooted at yi (i.e., all agents i such that yi = yi) are located at yi, and all the agents rooted 
at yn (i.e., all agents i such that yi = yn) are located on their own new edges, rooted at y„, and 
equally distanced from it (see Figures 3 and 4). 

At this point, the transformed graph can be shown to have the following properties: At least |A^|/3 agents 
are located at yi, at least | A^|/3 agents are located on new edges, rooted at y„ and equally distanced from it; 
some agents are scattered along path{yi,yn)', and the rest are located on edges that are rooted at the same 
location on path{yi,yn). Denote this location by p. The location p is essentially the original location of 
Opt (note that after transforming the graph using Lemma l28l Opt might be relocated). We now distinguish 
between several cases, depending on p's location: 

• If p is located at y„, then Opt could be located either at y„ as well, or in a subtree rooted at p. In 
the former case, a 1^ -approximation is obtained, by Lemma[32l In the latter case, we transform the 
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graph using Lemma [30] (see Figure 5), and by Lemma [33] we achieve an approximation ratio of 1.83. 

• If p is located on the open interval path{yi,yn), we transform the graph using Corollary ]3T] (see 
Figure 5), and again, by Lemma[33l we achieve an approximation ratio of 1.83. 

By observing that p cannot be located at yi, we conclude that these two cases exhaust all the possibilities 
for p's location. In conclusion, since all the transformations are shown to only worsen the approximation 
ratio, the 1.83-approximation that is obtained for the final instance, imposes the same upper bound on the 
original one, and the assertion of the theorem follows. 




Figure 2: Lemma ]27Ts transformation assures that all the agents that are not located on path{yi,yn) are 
rooted at either yi, or Opt. 




Figure 3: Lemma 1281s transformation assures that all agents that are rooted at yi are in fact located at yi. 




Figure 5: Lemma [30] and Corollary [3T] average the distance of the agents that are rooted at Opt, while 
locating them on the same subtree. 
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APPENDIX 



A Missing proofs in Section |2] 

Proof, of LemmaO Suppose by way of contradiction that there exists Tj G T(G, a), such that 

^ Wid{yi,a)> ^ Wid{yi,a). (1) 

Let h be the distance between a and the closest vertex or location in y in the subtree Tj (such that d{a, h) > 
0). Let h{y) be a function /i : [0, 6) — )• M that yields the weighted sum of squared distances for the location 
that is distanced y from a on the subtree Tj . It holds that 

h'{0+)= Yl Wi2d{yi,a)- ^ Wi2d{yi,a) < 0, 

where the last inequality follows by Equation ([T]l- It follows that there exists a location for which the 
weighted SOS distances is lower than in a; hence we reach a contradiction. 

Sufficiency: Assume that for every Tj G T{G, a), it holds that 

Wid{yi,a)< ^ Wid{yi,a), 

ie[m\:yi£Tj ie[m]:yi<^Tj 

and suppose by way of contradiction that a is not the weighted average location. Let 6 be a weighted 
average location. Let T;, € T(G, a) be the subtree such that h S T5. Let Sa G T{G, b) be the subtree such 
that a £ Sa- 

Since T{G, b)\Sa C Tf, , it holds that 

Wid{yi,b) < y^ Wid{yi,a) < Wid{yi,a). (2) 

ie[rn]:yi^Sa i&[m]:yifSa ie[m]:{/iGT(, 

In a similar manner, since r(G, a)\T5 C S'a, it holds that 

Wid{yi,a) < ^ Wid{yi,b) < Wid{yi,b). (3) 

ieH:3/i^rj, ieM:j/i^r6 ieHii/ieSa 

Following the fact that 

Wid{yi,a) < y^ Wid{yi,a), 

ie[Tn]:yi&Tf, ie[m]:yii^Tt, 

and using Equations ([21) and (O, it follows that 

y^ Wid{yi,b) < y^ Wid{yi,b), 

i€[m]:yi(fSa i&[m]:yieSa 

which contradicts necessity; the assertion follows. ■ 

Proof, of Corollary IH Assume by way of contradiction that there exist two different weighted average 
locations, denoted by a and b. Following Lemma[3l for every Tj € T{G, a), it holds that 

y^ Wid{yi,a)< y^ Wid{yi,a). 

ig[m]:{/ieTj ie[m]:yi^Tj 
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Following the same notation and reasoning as in the proof of Lemma[3] (sufficiency), it follows that 

^ Wid{yi,b) < ^ Wid{yi,b), 

which, according to Lemma[3l implies that h is not a weighted average location. The assertion follows. ■ 

Proof, of Lemma|5} Assume on the contrary that d{a, a') > J2i(^[m] '^i^i- Let Tq € T{G, a) be the subtree 
such that a' G Tq, and let Sq G T{G, a') be the subtree such that a G Sq. By Lemma[3]we know that 

0< ^ Wid{yi,a)- ^ Wid{yi,a). 

i€[m]:yi(^To ie[in]:yieTo 

It follows that 

-^Wi6i< ^ Wid{yi,a)- ^ Wid{y[,a). (4) 

We note that T(G, a)\To C 5o and that T{G, a')\So C Tq. The next equations follow. 

^ y;id(y-,a')= ^ a) + d(a, a')) + ^ Wid{y[,a'), (5) 

^ w;id(y-,a)= ^ ujj a') + d(«> «')) + XI Wid{y[,a). (6) 
Considering that 

X Wj(i(y-,o')+ X Ti;id(y-,a)> ^ T(;i(i(a', a), 

jeH:?/iG7on5o ieH:i/ieron5o i(^[m]:y[(iTor\So 

and combining Equations ([5]) and Q, it follows that 

X ti;id(y-,a') - ^ Wid{yla') > 

ie[m]:y'^eSo ielm]:y'.^So 

X Wid{y[,a)- X u;id(y-,o) + X ti;id(a,a'). (7) 

Since (i(a, a') > X^jg^^j it follows from Equations @ and ([T]) that 

X u;i(i(y-,a') - ^ a') > 0. 

It then follows by Lemma[3]that a' cannot be the weighted average location with respect to G, y' and w^; 
hence we reach a contradiction. ■ 

Proof, of Lemma[6]For ease of presentation, we denote 6 = d{a, b). When taking sc(a, x) as a basis, the 
increase of cost when examining sc(6, x) is as follows 

X {2d{xi,a)5 + + ^(2d(xi,a),5 - 5'^) = 
\N\5'^ + 26{ X d{xi,a) -^d{xi,a)) 
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The assertion follows. 



Proof, of Corollary |7] It is easy to see that sc(a,x) = sc(a,x'), that sc(5, x) = sc(6, x'), and that 
T.ieG\ndixi,a)-'£.^j.^d{xi,a) = Y.ieG\nd{x'^,a)-Y,^^^^d{x[,a). It is left to show that XliecVT^ ^(^'i' « 

It is clear that there are exactly two subtrees in T{G, Opt') which contain agents (given x'). We denote 
them by Topti, which contain a, and ropi2- FoUowedby Lemma [3l it is easy to see that ^ G Toptid{x[, Opt' 
Y^x'^e Topt2d{x'^, Opt'). It follows that 

^ {d{x'i,a) + d{a,Opt'))+ Yl {d{a,Opt')-d{x'i,a))= ^ {d{x'i,a) - d{a,Opt')) ^ 

ieG\n i&nr\Topti ieTopt2 

^ d{x'i,a) - ^ d{x'i,a) = -\N\d{a,Opt'). 
The assertion follows. ■ 



B Missing proofs in Section |3] 

Proof, of Theorem |9j Assume by way of contradiction that there exists an agent i that can benefit by 
misreporting her location as a;-, inducing a location profile x' = (x-, x_i). We quantify the effect of the 
deviation on the boomerang component and the average component of Mechanism PB. We begin with the 
boomerang component. For every j G [m], let 5j = d{fj{'x'),Xi) — d{fj{x.),Xi) be the additional cost 
incurred by i due to the deviation, when fj is chosen. Since fj is a boomerang mechanism, it holds that 
d{fj{x.'),Xi) — d{fj{x.),Xi) = d(/j(x'), /j (x)) > 0. Therefore, the additional cost incurred by i due 
to the boomerang component is X^^gj^] Wj6j = J2j£[m] ''^j^ifji'^.'), /j(x)). By LemmaO the average 
component reduces agent i's cost by at most J2j£[m] ''^jdifji^'), fj{^))- The assertion of the theorem 
follows. ■ 

Proof, of Proposition [TT] In what follows we establish the correctness of the assertions of the proposition. 

1. Any mechanism that chooses the fc-location, is an instance of Mechanism PB, as it is a boomerang 
mechanism. 

2. The left-right-middle (LRM) mechanism is an instance of Mechanism PB, with m = 2, w = 
(1/2, 1/2), and /i and /2 being the left-most and right-most mechanisms, respectively (which are 
boomerang mechanisms). The weighted average location is the center of the left-most and right- 
most locations, and is chosen with probability 1/2, while each of the extreme points is chosen with 
probability 1/4. 

3. The random dictator (RD) mechanism is the uniform distribution over dictator mechanisms, which 
are boomerang mechanisms. The assertion follows by Corollary [TOl 



Example 24 Consider the following mechanism on a line. Choose each of the leftmost and rightmost 
agents with probability and the center of every two consecutive locations with probability ^. It is easy 
to verify that this is an SP mechanism, yet it cannot be formulated as a special case of Mechanism PB or 
a probability distribution over PB mechanisms. 
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C Missing proofs in Section |4] 

Proof, of Claim[T2l The optimal location can be derived by taking the derivative of the SOS cost function. 
As the derivative of X^jgjy Iv ~ ^ieN '^V ~ the point that minimizes the SOS cost is ^'^^^ ^' . 



Proof, of Lemma[T3l In the profile x'^, m agents are located at c. For j = 0, . . . , m, let x-' be the profile 
in which n — m agents are located at a, j agents are located at b, and m — j agents are located at c; and 
let ~ /(x-'). Since / is SP, it must hold that E[\c — y^\] < E[\c — y^\]; otherwise, in the profile x°, 
every agent can decrease her cost by misreporting her location to be at b instead of c. Similarly, for every 
j = 2,...,m, itholdsthat£;[|c-y''"^l] < E[\c - y^]. It follows that £;[[c - < E[\c - y'^\]. The 
same line of arguments is used to prove that E[\b — < E[\b — y^\], which concludes the proof. ■ 

Proof, of Theorem [14] Let be a median location in x. It is well known that the median mechanism on 
a line is truthful (see, e.g., [?]). Therefore, it remains to show that it provides a 2-approximation to the 
miniSOS objective; formally, we need to show that 

J2\xi- <2j2\Opt-x^\\ (8) 

ieN ieN 

Assume without loss of generality that xi < X2 < ■ ■ ■ < x„. Assume additionally that /i < Opt (the proof 
works analogously for the case in which fi > Opt). It is easy to verify that 



^ \X^ - = ^ \{x, - Opt) + {Opt - 

ieN ieN 

= ((x. - Opt)' + 2{0pt - - ^^)) . 
ieN ^ ^ 

Therefore, by subtracting J2ieN \Opt — a^ip from both sides, it remains to prove that 

2{0pt - - < \Opt - 

ieN ieN 

which is equivalent to showing that 

E 2(0„<-,)('(.,-^) + (.W-<-^))< 



i<\n/2] ^ ' 

\Opt - Xif ^\Opt - Xn+\-if . 

i<rn/2] 

We next show that the last inequality holds piecewise; i.e., for every i < [n/2] , it holds that 

2{Opt - /.) ({X, - 9pL±J^) + _ ^M_L^)^ < (Opt _ ^^)2 + (Opt - Xn+l-if. 



For every i < [n/2], it holds that xi < ji < Opt; thus {Opt — /x)^ < {Opt — Xi)', and it suffices to show 
that 

2{0pt -fi)(fi- ^^^^ + - ^^^)^ < {Opt - + {Opt - x„+i_,)2. 



It can be easily verified that the above inequality holds iff {20pt — /x — Xn+i-i)'' > 0; the assertion of the 
theorem follows. ■ 
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Proof, of Theorem [T5l Assume by way of contradiction that there exists a deterministic SP mechanism / 
which yields a better approximation than 2. Consider a location profile x, in which ^ agents are located 
at 0, and ^ agents are located at 2, and let /(x) = p. Simple calculations show that to achieve a better 
approximation than 2, it must hold that p G (0, 2) (note that the optimal location is 1, which obtains an 
SOS cost of n, while the locations or 2 obtains each an SOS cost of 2n). Now consider a different 
location profile, denoted x^, in which ^ agents are located at 0, and ^ agents are located at p. Following 
the same argument, it must hold that /(x^') € (0,p), and thus \p — /(x*')| > 0. Since /(x) = p, we get 
that \p — f{x^^) \ > \p — /(x) |, which implies, by Lemma [T3l that / is not SP ■ 



Proof, of Theorem [T6l Let rd(x) denote the RD mechanism. It holds that 

sc{rd{x.),x.) = ^ — ^ {xi — Xj)^ (by def. of sc) 



n 



2 



0) 



+ ^ X^ X] ^^^^ 

i£N jeN jeNieN 

ieN jeN ieN ^ ^ 

2^x\- 20pt ^ {2xi - Opt) (by Observation [E]) 

2j2iOpt-x^f 

2sc{Opt, x) (by def. of sc). 



Proof, of Theorem [T71 We first prove the approximation factor. Let g'(x) denote mechanism [T] and let 
avgi'x.) denote the average point. By Observation [121 the optimal location with respect to miniSOS is 
avg{-x). We get 

gc(g(x),x) ^ ^RDjx) + ^avg{x) ^ ^ ^ 
Opt avg{x) ' 

where the last equation follows by Theorem [T6l 

In order to show the strategyproofness of the mechanism, it suffices to prove that it is an instance of 
Mechanism PB. Indeed, one can easily verify that this is a special case in which m = n, w is the uniform 
distribution over [n], and for every i G [n], /i(x) = Xj (i.e., /j is dictatorship with agent i as the dictator). 



Proof, of Theorem [TSl Assume on the contrary that there exists an SP mechanism /(x) and e > such 
that /(x) yields an approximation ratio of 1.5 — e, and let y ~ f{x) (i.e., y is a random variable that is 
distributed according to f{x)). We shall use the following lemma in the proof. 

Lemma 25 There exists some a G M and a location profile x in which ^ agents are located at a and ^ 
agents are located at a + 4, such that E[\y — (a + 1) \ + \y — {a + 3)\] > 3 — 2e. 

Proof. Consider a location profile x'^, in which ^ agents are located at and ^ agents are located at 4, and 
letyO ~ /(xO). If S[|y0-3| + |y°-l|] > 3 -2e, then we are done. Otherwise, either - 1|] < 1.5-e 
or E[\y^ — 3|] < 1.5 — e. Assume w.l.o.g. that the latter holds, and consider a location profile x°, in which 
^ agents are located at and the rest are located at 3. Let y^ /(^'^)- By Lemma [T3l to preserve 
truthfulness, it must hold that E[\y^ - 3\] < 1.5-e, which implies that E[\y^ - 0\] > 1.5 + e. 
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Consider next a location profile x^, in which ^ agents are located at —1 and the rest are located at 3, 
and let ~ /(x^)- By Lemma [T3l to preserve truthfulness, it must hold that E[\y^ — 0|] > 1.5 + e. If 
E[\y'^ - 2| + \y^ - 0|] > 3 - 2e, then we are done. Otherwise, it follows that E[\y^ - 2|] < 1.5 - 3e. 

We continue iterating such that in iteration j = 1, 2, . . ., a profile x.^ is considered, in which half the 
agents are located at — j and half are located at 4 — j, and for every profile x-' , we denote by y^ the random 
variable distributed according to /(x-^). We show that there exists some j for which E[\y^ — (1 — j)| + 
\y^ — (3 — > 3 — 2e; the assertion of the lemma then follows by substituting a = —j. It remains to 
prove the last inequality. Indeed, by repeatedly applying Lemma[T3]for every j, we get that 

if E[\y^ - (1 + \y^ - (3 - j)|] < 3 - 2e, then E[\y^ - {3 - < 1.5 - e{2j + 1). (9) 

But since fori > it holds that 1.5-e(2j+l) < 0,itmustholdthatS[|y-'-(3-j)|] > 1.5-e(2j+l), 

which, by Equation |9] implies that E[\y^ — (1 — j)| + \y^ — (3 — j)|] > 3 — 2e. It follows that the profile 
x^ satisfies the conditions of the lemma, and the proof follows. ■ 

With this lemma, we are ready to prove the theorem. Let x be a location profile that satisfies the conditions 
of Lemma |25l and assume w.l.o.g. that x is a profile in which half the agents are located at and half at 4. 
By the last lemma, it holds that 

^[|y-l| + |y-3|] >3-2e, (10) 

where y ~ /(x). For ease of presentation, let p = Pr{\y — 2| < 1) and let z = E[\y — 2| : |y — 2| > 1]. 
It holds that 

E[\y - 1\ + \y - 3\] = E[\y - 1\ + \y - 3\ : |y - 2| > l](l-p) 
+ E[\y-l\ + \y-3\ : \y - 2\ < l]p 
= 2z{l -p) + 2p. 

Therefore, by Equation (ITOl ). it follows that 2z(l — p) + 2p > 3 — 2e. Since z > 1 by definition, it follows 
that 

1 - 2e 

P<1 . (11) 

^ 2z-2 

We now turn to calculate the SOS cost of the profile x induced by the mechanism /(x). It holds that 

E[sc{y,x)]=E[sc{y,x) : \y - 2\ < l]p + E[sc{y,x) : \y - 2\ > - p) 

>sc{E[y:\y-2\<l],x)p + E[sc{y,x) : |y - 2| > 1](1 - p), (12) 

where the last inequality follows from Jensen's inequality. Since 2 is the optimal location in the profile x, 
it holds that 

sc{E[y : |y- 2| < l],x) > sc(2,x) = 4n. (13) 



We also have that 



E[sc{y, x) : |y - 2| > 1] = E[^iy^ + (y - 4)2) : |y - 2| > 1] 

= -{8 + 2E[{y-2f :\y-2\>l]) 
> !^{8 + 2E^[\y-2\ :\y-2\> 1]) 
= ^(8 + 2^2), (14) 

where the last inequality follows from Jensen's inequality. By Substituting ([T3] ) and ([T4l ) in ([T2l ). we get 
that E[sc{y, x)] > §(8 + 2^^ - 2z'^p). It, therefore, follows by ([IB that 

i?[sc(y,x)] > -(8 + 2z2 - 2^2(1 _ __)) = 4n + ^j^y^- (15) 
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It is easy to verify that this function attains its minimum at z = 2, with a value of 6n — 4ne. Thus, 
E[sc{y, x)] > 6n — 4ne. The optimal solution is to locate the facility at 2, which yields an SOS cost of 4n. 
We get that ^^^Qp^^^^ > ^"J^"*^ = 1.5 — e, and a contradiction is reached. The assertion of the theorem 
follows. □ ■ 



D Missing proofs in Section |5] 

Proof, of Theorem [T9l It has been shown in earlier papers that a median of a tree is SP (see, e.g., [?]). 
Here we show that it provides 2-approximation with respect to the miniSOS objective. 

We start with some notation. Given a location profile x, let n and Opt denote the median and the optimal 
location, respectively. Let T^?''* € T{G, /u) denote the subtree of T{G, fi) that contains Opt, and let 
L{G, x) C iV denote the subset of agents i such that Xi T^^* \ {fi}. Similariy, let T^^^ G T(G, Opt) 
denote the subtree of T{G, Opt) that contains fi, and let R{G, x) C denote the subset of agents i such 
that x-i Tppt \ {Opt}. Finally, let M(G,x) C N denote the subset of agents that belong to neither L 
nor R. These are the agents i, such that x-i ^ fi and Xi ^ Opt, for which path{xi, fj.) Ci path{xi, Opt) is 
one location on the graph, rather than a path. When clear in the context, we simplify notation and write 
L,R,M. 

With this, we continue. Suppose /U 7^ Opt (if fi = Opt, we are done). Let k be the number of agents in 
M{G, x), and refer to these agents as xi, . . . , (order them arbitrarily). In what follows we describe an 
iterative process, which starts with a tuple {G^ = G, x*^ = x) of graph and location profile, and in every 
iteration j G [A;], a new tuple (G-', x-') is induced in a specific manner that will be specified soon. We will 
show that the approximation ratio in every such iteration can only get worse. The assertion of the theorem 
will then be established by proving the desired approximation ratio on the final instance. 

For every j € [k], let OpP and fi^ denote the respective optimal location and median with respect to 
(G-', x-'). For every j G [k], define (G-', x-^) as follows. Starting with G^~^, add an edge, rooted at 
OpP^^, of length da {xj, Opt{G, x)), and move Xj from its position in G^^^ to the end of this edge (see 
Figure [U for illustration). We will also use the notation xl to denote the location of Xi in G^ for every 

ie[n],j e [k]. 

It is easy to see that fi^ = /x-'"^ for every j (since agents never cross the median in their new location). We 
shall, therefore, use fj, to denote the median of every iteration. 

It is slightly more subtle to see that the optimal point does not change either; i.e., OpP = OpP~^ for 
every j G [k] . 

Lemma 26 For every j G [k], it holds that OpP~^ = OpP. 

Proof. Let denote the subtree of T{G^~^,OpP~^) that contains /i. Note that in G^~^, Xj G 
as well. Let denote the subtree of T{G^ ,OpP~^) that contains /i, and let Tj denote the subtree of 

T{G'J, Opp-^) that contains xj. Note that \T{G^ ,Opp-'^)\ = \T{G^-^ ,Opp-'^)\ + 1. 

By applying Lemma[3]with w as the uniform distribution, for every T G T{G^^^, OpP^^), it holds that 

d{x{-\Opp-^) < d{x{-\Opp-^). 

je[n]:2,'^"^gT i£[n]:xl~^^T 

It is not too difficult to see that this implies that for every T' G T{G^, OpP^^) such that T' / T'^, Tj, it 
holds that 

Y d{x{,Opp-^)< Yj d{x{,Opp-^). 

ie[n]:xl€T' i&[n]:xi^T' 
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In addition, due to the fact that we split into T'^ and Tj, the same holds for them. It follows from 
Lemma |3] that OpP - ^ = OpP . ■ 



Since this iterative process changed neither the optimal location, nor the distance of the agents from the 
optimal location, the optimal social cost did not change either. That is, the optimal social cost in (G, x) 
is the same as the optimal social cost in {G^,x.^). In contrast to the optimal cost, it is not difficult to see 
that the mechanism's cost increased. This is because the distance between and the agents in M{G, x) 
increased (while it did not change for the other agents). It follows that it is sufficient to prove the desired 
approximation ratio on the instance (G'^, x'^). 

In order to establish a 2-approximation on (G'^,x^), we modify the instance (G'^,x'^') to a line instance 
(M, z) (for z G M"), such that, the distances of every agent from the median and from the optimal location 
are preserved. 

Let L'' = L{G'',x''),R'' = /^(G^x'=) and M'' = M(G'=,x'=). By construction, M'' = 0. Consider the 
following location profile, {zi, . . . , Zn), on a line. For every i G L'^, let Zi be located at —dQk {x^, fi^). For 
every i € i?*"', let Zi be located at dQk{^^^x^) (which is equal to dQk [ft'' , Opt'') + dQk {x'^ , Opt'')). Let 
b = dQk {fi'', Opt''). For every i € [n] it holds that d{zi,0) = dQk {x'^,n''), and d{zi,b) = dQk (xf , Opt''); 
i.e., the respective distances from /i'^ and Opt'' were preserved. 

Let /Uz and Optz, denote the median and the optimal location with respect to (M, z), respectively. It is easy 
to verify that < and Optz > b. By Theorem [l4j locating the facility at fiz provides 2-approximation 
for this line instance. But since < < fe < Opt^, it follows that locating the facility at yields a social 
cost (SOS) that is at most twice that of b; i.e., 

ie[n] iG[ra] 

However, because the respective distances from fi'' and Opt'' were preserved, it follows that 

dG.(xf,/)2 < 2 ^ dQkixlOpt'')^ 

iG[n] ie[n] 

as required; the assertion follows. ■ 

Proof, of Proposition |20l It is easy to verify that an agent can modify the location of the mechanism's 
outcome only by declaring herself to be on the location's opposite side, thus pushing the returned facility 
location away from its true location. It follows directly that the distance between the original location and 
the new one is exactly the additional cost imposed on a misreporting agent. ■ 

Proof, of Lemma |2T] For every i £ [n], let yi = /i(x). By design, every subtree T € T{G, yj) (excluding 
yj) contains less than qn agents. Assume by way of contradiction that the points yi, . . . , y„ are not located 
on a single path. Let Ty be the subtree induced by connecting all the y^'s (i.e., for any a G Ty, there exist 
i,j G [n] such that a G path{yi,yj). By the contradiction assumption, Ty contains at least three leaves; 
assume w.l.o.g. these leaves are yi,y2,y3- Then, for every j G {1, 2, 3}, there exists a subtree in T{G, yj), 
call it T^, that contains Ty. By design, T^ \ {yj } contains less than qn agents; therefore, more than (1 — q)n 
agents are located at {T{G, yj) \T^}U {yj}. It is easy to verify that {T{G, yi) \ T^} U C \ 
and also {T{G, 7/2) \ T'^} U {2/2} C T^ \ 7/3. We get that \ 2/3 contains more than (2 - 2q)n > |n 
agents, where the last inequality follows by ^ < q < |. This contradicts the choice of 2/3 by /3(x); the 
assertion follows. ■ 

Proof, of Lemma[23l Assume that the optimal location is not at yi (otherwise, the approximation ratio is 
1 and we are done). By design, if yi = 7/2 = • • • = Un, then there does not exist T G T{G, yi) such that 
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there are more than n/3 agents in T \ {yi}. Let Tppt G T{G, yi) such that the optimal location is in Topt- 
Let C be a new tree which is created by positioning a new subtree T' which is constructed by a long edge, 
rooted at yi, and let x' be a new location profile, such that for any Xj G Topt \ {yi}, x[ is located on T', at 
the location that is distanced d{yi,Xi) from yi. Let the optimal location with respect to the new ti^ee and 
x' be denoted as Opt'. Let 6 denote d{Opt', yi). It is straightforward that because Opt is in Topt, then 
Opt' is in T'. Let Tr € T(G', Opt') such that yi G Tr and let Tl be the other subtree. 

Following a simple variation of Corollary |7l it holds that 

scG'iyi-,^) = scG'{Opt' ,x') + n(5^. (16) 

By Lemma|3]it follows that YIx'-gTl ^^^'v Opt') = '^Zx'.&Tr d{x'^,Opt'). Since the right expression is at 
least |n5, so is the left one. 

We shall next examine the social cost of Opt' with respect to G' and x'. at least |n agents in Tr add 
at least 6^ each to the social cost obtained at Opt'. On the other hand, following Jensen's inequality, the 

minimal addition agents in Tl yield to the social cost is obtained when they are located at distance of 

T,x'eT <i{x'.,Opt') 

— * — jjr-^ from Opt'. Thus, their addition is at least 



'T.xr&Ti,d{x'i,Opt' 
\Tl\ 




As this expression decreases in \Tl\, and noting that \Tl\ < ^, it follows that their addition is at least 
|n(5^. Summing it all together, it follows that the see {Opt', x') > 2n(5^. Applying (fT6l) . it follows that 

scG'(yi,x') 



sec (Opt', x') 



Now, when considering G and x, we see that the mechanism's induced social cost does not change (as the 
distances from yi do not change), while we added constraints that might affect the optimal location. The 
assertion follows. ■ 



We shall now present the formal statements of the lemmata that are required in the proof of Theorem |22] 
Since yi and yn are symmetric, we state and prove every lemma for one case; the complementary one can 
be proved analogously. 

Lemma 27 Let i be an agent such that yi is in the open path{yi , yn), and assume that Opt G path{yi , yn)- 
Let {G' ,ii!) be a transformation of {G,'x) as follows. Ifd{xi,yi) < d{yi,yi), then position i on path{yi, yi), 
distanced d{xi,yi) from yi. Ifd{xi, yi) > d{yi,yi), then create an edge, rooted at yi, of length d{xi,yi) — 
d{yi,yi), and locate i at its tip. Let (G" , x") be a transformation of (G, x) as follows. Create an edge, 
rooted at Opt, of length d{xi,Opt), and place i at its tip. Then, the mechanism obtains a worse approxi- 
mation ratio for at least one of the tuples {G' , x') and {G" , x"), compared to (G, x). 

Proof. We start by defining a parameterized instance, namely [G^ ,yi^), as follows. For any < 5 < 
d{y'-,Opt), originating at (G',x'), create an edge rooted at the location that is distanced 8 from y- on 
path{y'i, Opt), of length d{x'i,y'i)+5, and locate i at its tip. We note that x^) and [G'^ivl^Opt) ^ ^d{yO,Opt)^ 
are in fact (C, x') and {G" , x"), respectively. 

Note that the optimal cost for any 6 does not change, as the distance between agent i and the original 
optimal location does not change, and the other agents are left untouched. Therefore, it is left to show that 
the social cost obtained by the mechanism is higher for either 5 = or for (5 = Opt) than the social 
cost obtained for the original instance (G, x). 

We shall use the following notations. Let Lj denote the agents that do not share the same subtree of yf 
as yn, i.e., let Tn G T{G^,y^^) such that y„ G T^; then, j G Li iff ^ r„. Given 5, let Mf be defined 
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as follows, j G iff j ^ i and yj € path{y^,yf). Let Rf be the complementary set of the agents 
excluding agent i, i.e., Rf = N-i\{Li U M/}. Let L(j„g(yO) denote the agents that do not share the same 
subtree of avg{y^) as y„, i.e., let T„ € r(G°, au5(y°)) such that y„ G T„, then j G L^vg{yO) iff ^ ^n- 
Similarly, we shall define Ravg{yO) = ]^\Lavg(yO)- 

Let (7 = min{d{yi,yi),d{xi, yi)). We note that G is in fact . Let represent the social cost obtained 
by the mechanism for the instance (C^, x*^). We shall next show how h{5) differs from /i(0), i.e., we shall 
characterize g{5) such that h{8) = h{0) + g{S). 

As we come to analyze g{S), we can divide the analysis into two components, namely the agent component 
and the weighted average component. 

We shall start with the agent component. If the projected location of an agent j G Rf is chosen, then 
the obtained social cost does not change, as the distance of Xi from yj does not change. If the projected 
location of an agent j G Li is chosen (i.e., yj), then the social cost increases (with respect to G° and x°) 
by 

4%,-,xO)<5 + 4<52. (17) 
If the projected location of an agent j G M/ is chosen, then the social cost increases by 

4d(y,-,xO)((5 - diyj,y^)) + 4(5 - diyj,y^)f. (18) 
If the projected location of agent i is chosen, then the social cost increases by 



^ {2d{y^,xj)5 + d') - {^diylxj)d-6') + 6^ (19) 

where the last summand is for i herself. Every agent's projected location is chosen with probability 
thereby deriving the agents component in f{S). 

We continue with the weighted average component. The influence obtained by the weighted average com- 
ponent (i.e., by avg{y)), consists of two parts; namely, the movement of o,vg{y) itself, and the influence 
of the movement of i on the social price obtained by avg{y). For the first part, since avg{y) smoothly 
slides towards yn by the distance of ^, the difference in cost originating by the first part is given by 

Y: (2dix%avgiy))^ + (^y]- ^ -^(^))^ " (^) ^ " ^^'^ 

For the second part, similarly to the classification done when characterizing the agents' component, we 
need to distinguish between three scenarios. Denote this influence by /3. 

If avg{y^) is not on the same subtree of y° as yn, i.e., let Tn G T{G'^,y'-) such that yn G T^, and 
avg{y^) i r„,then/3 = ^d{avg{y^),x\)M^'^ . \iavg{y^) G pat /i (yf, yf), then /3 = M{avg{y^),x\){8- 
•^(0^5(5^^), y^)) +4((5 — d{avg{y^)^y^^y)^ . If neither of the above holds, then it preserves its distance from 
Xi, i.e., ^ = 0. 
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In summary, when summing over the different components, we get: 



=h [ll {Hyj,x'^)d + 4d') 



+ 



1 

2n 



{4d{y„x^){6-d{y„y^)) + 4i6-d{y„y^)f) ) + 

\jeMf 



\jeLi je{M/uilf} / 



One can verify that g{5) is continuous, and is piecewise differentiable (in particular, it is not differentiable 
for any 6 such that there exists j € N\Li such that 6 = d{yf,y-j), and when 6 is such that avg{y^) = yf). 
Moreover, one can also verify that for any 6, the left-hand derivative is smaller or equal to the right-hand 
derivative of 3, i.e., for any (5, 5^ ((5) < g'^{S). In other words, 5 (5)' s derivative is monotonically increasing 
where it is defined (this can be easily seen, as the quadratic components are all positive). RecalHng that 
h{5) = h{0) + g{S), and considering that < cr < d{yf, Opt), it follows that either h{0) > h{a), or 
h{d{y^,6pt)) > h{a). The fact that (G'", x"^) is (G, x) concludes the proof. 



Lemma 28 Let Ti be the subtree of T{G,avg{y)) that contains yi and assume that Opt ^ T\. Let 
(G', x') be a transformation of (G, x) that, for every agent ifor which yi = yi, locates x\ at y\. Then, the 
approximation ratio obtained by the mechanism for (G',x') cannot be better than the ratio obtained for 
fG,x). 

Proof. Let 8 = d{xi,x'-). The difference between the social cost obtained by the mechanism for (G, x) 
and for (G', x') is given by 



J2 ^ {nxi,yj)S + l {2d{xi,avg{y))S - (J^) = 

- + ^2(d(xi,yi) +d(yi,yj))(5 + -2{d{xi,yi) + d{yi,avg{y)))5 = 

-6'^ + 2dixi,yi)S+J2 ^2d(yi,%)^ + lMyi,avg{y))d = 

- 52 + 2d{xi,yi)S + 2d{yi,avg{y))5, 

where the last equality follows from YljeN d-iyi, yj) = nd{yi,avg{y)). 

In contrast, it is easy to verify that the optimal cost decreases by at least 2d{xi, yi)S + 2d{yi, Opt)5 — S^. 
As d{yi,avg{y)) < d{yi,Opt), the assertion follows. ■ 



Lemma 29 Let Topt be the subtree in T{G,yn) that contains Opt. Let Nq = {i ^ N : yi = yn,Xi ^ 
Topt}' (^nd let a = X^j^jVo d{xi, yn)- Let (G', x') be a transformation of (G, x) that, for each i G Nq, 
creates an edge of length rooted at yn, and locates x\ at its tip. Then, the approximation ratio 
obtained by the mechanism for (G', x') cannot be better than the ratio obtained for (G, x). 



xi 



Proof. We first note that for each i £ Nq, y'- did not move. Therefore, the locations that have a positive 
probability of being chosen by the randomized mechanism do not change. 

Let 6i denote the difference in distances of agent i from y„, induced by the transition to (G',x'), i.e., 
6i = d{yn, x'j) — d{yn,Xi). Recall that x-) = thus it is easy to see that = 0- It follows 

that the increase of cost in {G', x') is as follows. 

E ( E ^ {2d{x^,y,)5^ + 6f) + \ {2d{x,,avg{y))6, + Sf) | + 

E ( E ^ {2d{xi,yj)5, + 5f) + \ {2d{x„avg{y))5, + 5f) 1 
{2d{xi,avg{y))6, + 6f) + ^ {2d{xi,avg{y))Si + 6f) 
{2d{x[,avg{y))6,-6^)+ ^ {2d{x'„avg{y))6i. - 6f) 

iGNo,6,>0 i£No,5i<0 
i£No 
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Where the last two equalities are due to d{xi, avg{y)) + 6 = d{x[,avg{y)) and to Y^{5i) = 0. We note 
that in the transition from (G, x) to (C, x'), Opt's location does not change (see LemmaO. Therefore, in 
the same manner, the optimal cost decreases by YlieN well. It follows directly that the approximation 
ratio cannot decrease in the transformation. ■ 



Lemma 30 Assume that Opt is located at yn, but Opt is not, and let Topt be the subtree in T{G, Opt) 
that contains Opt. Let Ni = {i N : Xi (z Topt}, cind let a = 'Ylii^Nx d{xi,Opt). Let (G',x') be a 
transformation of {G, x) that creates an edge of length rooted at Opt, and locates all of the agents in 
Ni at its tip. Then, the approximation ratio obtained by the mechanism for {G' , x') cannot be better than 
the ratio obtained for (G, x). 

Proof. We shall prove the lemma in two steps. In particular, the instance (G',x') shall be induced by 
(G, x) through an intermediate instance (G", x"), as follows. Let j G A'^i denote the agent whose location 
is the farthest away from Opt, i.e., j = argmaxi^N^d{xi, Opt), and for each i £ Ni, let x'- be located 
on path{xj,Opt), distanced d{xi,Opt) from Opt. It is easy to verify that the cost of the randomized 
mechanism does not change in the transformation, while when examining the optimal location, clearly its 
induced cost decreases, as agents in A'^i could only move closer to it. It follows that the approximation 
ratio increased by the transformation. Next, we make the transformation from (G",x") to (G',x'), and 
show that the approximation ratio keeps getting worse. 

Using the notation of the proof of Lemma [29l let 6i denote the difference of distances of agent i from Opt, 
induced by the transformation from (G",x") to (G',x'). Following the steps of the proof of Lemma [29] 
it follows that the mechanism's induced cost is reduced by J2ieNi ^i- ^^^^ show that the optimal cost 
decreases by the same amount. 

It is clear that Opt" divides G" to exactly two subtrees. Let Ti,T2 G T{G", Opt") be the two subtrees 
induced by Opt". By Lemma[3]it follows that J2ieTi ^i^'I^ Opt") = J2ieT2 '^(^f' Opt"). After averaging, 
it is easy to verify that J2i£Ti d{x^,Opt") = '}2i£T2 "^(^i' Opt"), and thus by Lemma|3]it follows that the 
optimal location does not change. Therefore, the same calculation from Lemma [29] holds for the optimal 
location, and thus the optimal cost decreases by YlieNi well. The assertion of the lemma follows. ■ 



Corollary 31 Let p be some location on the open interval path[yi , yn). Let N2 = {i £ N : yi = p}, and 
let a = X^j£^2 d{xi,p). Let (G',x') be a transformation of {G,:x.) that creates an edge of length 
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rooted at p, and locates all of the agents from N2 at its tip. Then, the approximation ratio obtained by the 
mechanism for (C, x') cannot be better than the ratio obtained for (G, x). 

Proof. By observing that the process does not change the locations of yi,y2, ■■■ ,yn, the proof is identical 
to the one of Lemma [30l ■ 



Lemma 32 Assume that for each i such that yi ^ yn, it holds that yi = xi. Let = {i € N : yi = 
yn, Xi 7^ y„}, and assume that for any j, k E it holds that d{xj,yn) = d{xk,yn) cind that Xj and Xk 
are not located on the same subtree induced by T{G, yn)- Then, the approximation ratio obtained by the 
mechanism is at most 1^. 

Proof. Let 7 = d{avg{y),yn), and let 6 denote the distance between agents in and y„. Note that Opt 
can be located only on path{yi , yn)- We consider the following two cases: the case in which opt is located 
at yn, and the case in which it is located somewhere else on path{yi,yn)- We start with the latter case. 

Due to LemmaO it follows that X^j^iVg d{xi,yn) > J^ieNs ^(^i' Vn)- Since 

^ d{xi,yn) = ^ d{yi,yn) = ^d{yi,yn) = \N\d{avg{y),yn), 

i^Ns i^N3 ieN 

and YlieNs d{xi,yn) = \N3\6, it follows that 

|iV|7 > \Ns\6. (21) 



Let x' be derived from x, such that for any i € -/V3, x- = y„. We note that all of the agents in x' 
are located on path{yi,yn), and that the mechanism described in Section l4!2l is identical to Mechanism 
randomized DGM with q = 2/3, when activated on x'. Thus, due to Theorem [TT] it follows that the 

approximation ratio is at most 1.5, i.e, ge*'(Opf'*'x')^ — "^^^^ ^^^^ ^ ' ^(y) and Opt are located 

at the same location. When making the transformation to x, the mechanism's possible locations do not 
change, and it is easy to see that the mechanism's induced cost increases by \N?j\{2'^5 + 5'^)- The change of 
the optimal cost can be calculated by checking how the original optimal location's induced cost increases 
by the transformation, and how it decreases by smoothly moving the location to the new optimal location, 
i.e., by calculating {sc{Opt', x) — sc{Opt', x')) — {sc{Opt', x) — sc{Opt, x)). Thus, using Lemma[6l the 
optimal cost increases by 

|iV3|(275 + 5')-^^. (22) 
Therefore, the approximation ratio of G and x is as follow. 

sc(/(x'),x') + |iV3|(27<5 + '^') 



sc{Opt',^') + \N,\i2j5 + 6^)-\^' 



(23) 



It follows that it is sufficient to prove that \N3\{2'y5+5 ) ^ ^ ^ simple rearrangement, this is 

|jV3|(27(5+<5^)- ' 

equivalent to proving that 3^^^ < 27 + 5. Using (|2T]) . it follows that 

\N3\5 \N3\6 , 

and the assertion follows for the case in which Opt is located on path{yi,yn), excluding y„. 

We now turn to prove the lemma for the case in which Opt is located at y„. We follow the steps of the proof 
of the first case, only this time the optimal location moves from avg{y) to y„, and thus in the transition. 
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sufficient to prove that 



the optimal cost increases by \N3\{2'y6 + 6"^) — \N\ (2-1^^7 — 7^ ). Therefore, in the same manner, it is 

< 1.5, (24) 



\Ns\{2^6 + 6^)--\N\ (2^7-72) 
which is equivalent to 



3|A^| (2^7-7^ ) < |iV3|(275 + ,52). (25) 



Clearly, \Ns\ < |[iV| < ||A^|. It follows that 



3|iV| (^2^^^7 - 72J = 6|iV3|(57 - 3\N\-f^ < 6\N3\6-f - 4|A^3|72. (26) 

Using (|26l ) in (|25] ). it is sufficient to show that 

6\N3\6j - i\N3\-f^ < \N3\{2j6 + 5^), 
which is equivalent to showing that (27 — 6)^ > 0. The assertion follows. ■ 



Lemma 33 Let G and ii.be a graph and a location profile as follows. Q C N agents are located at yi; 
P C N agents are equally distanced from y„, each on a different subtree ofT{G, yn), such that for each 
agent i £ P it holds that yi = yn,' M C N agents are located at the same location, distanced 6 from 
path{yi , yn), and the rest of the agents ( i.e., agents in N\{P U Q U M} ) are scattered along path{yi , yn)- 
Then, the approximation ratio obtained by the mechanism is at most 1.83. 

Proof. We start with a few preparations. First of all, we normalize ci(yi, y„) to be 1, and rescale the graph 
accordingly. Given i G M let yM = Hi- Let /? denote the distance between yi and yM, and assume 
d{yi,yM) < d{yM,yn) (the proof works analogously for d{yi,yM) > d{yM,yn))- Let x' be defined as 
follows, for each i G M, let x[ = yi, for the other agents, let x- = Xj. 

Due to Lemma [32I it holds that 

sc(/(x'),x') 1 

sc{Opt',^') -2' ^ ' 

Let Aopt = sc(Opt,x) - sc{Opt',x.') and Auan = sc(/(x),x) - sc(/(x'), x'). 
We start with providing an upper bound for Aran- 

We first observe that \Q\ should contain the minimal number of agents in order to maximize the costs' 
difference. Indeed, each agent that could be relocated at P, keeping yi fixed (meaning, keeping |Q| > 
|A^|/3), would increase the costs' difference. The same holds for any agent that is located anywhere on 
the open path{y I, yn)- 

We next observe that the distance between yM and avg{y) cannot be higher than | — ^j^, due to the follow- 
ing. Assuming that avg{y) is on path{yM, Vn) (the proof works identically for avg{y) on path{yi , t/m)), 
let x*^ be constructed such that x*^ differs from x such that for each i £ M, = y„. Its easy to verify 

that d(avg{y^),yM) — ^-^d{yn, yu) = d{avg{y), yM), as in the transition of each agent in |M| from x*^ 
to X, the weighted optimal location transitions by j^d{yn, yu)- Therefore, the distance between yM and 
avg{y) is bounded as follows: 



d{yM,avg{y)) = d{avg{y°),yM)-j^d{yn,yM) = d{yn,yM) (^1 - j^^-d{yn,avg{y^)) < 



\M\ \M\\ . n.. .2 \M\ 

3~^' 
(28) 

where the last equality is due to the fact that because there are at least ^ agents that are projected on each 
side of path{yi,yn), d{yn, avg{y^)) must be bigger than i. 
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We can now bound the difference of costs between sc(/(x), x) and sc(/(x'), x'), by the following expres- 
sion (an explanation follows): 



IMI 



\M\ 
W\ 



6 + 



(29) 



Each summand represents the expected difference of costs induced by each location with a positive prob- 
ability. The first summand stands for agents in M (the difference of costs when choosing agent in M is 
6"^, and the probability is ^tot); the second summand stands for agents in Q, and as shown earlier, in order 



to bound the difference, we assume that | Q | = | ; the next summand stands for P, and the probability of 
choosing an agent in P is calculated given \M\; and avg{y) is the last summand. Naturally, the whole 
expression is multiplied by |M|, as the expected difference of costs is multiplied by the number of agents 
that make the transition. 



By rearranging expression 



, we get 

5^ + f3 



\M 



\M\6 

T/vT 



3 +3^ 



2\M\6 
\N\ 



(30) 



Since ^ < i, it holds that (3 



\M\S 
\N\ 



< 0. It, therefore, follows that 



ARan < \M\ 



6^ + -d 



2\M\S' 



(31) 



Therefore, we conclude the following. 



sc(/(x) 



sc{Opt, x) 



< 



sc(/(x'),x') + |M| 



2\M\S 
\N\ 



sc{Opt', x') + A 



Opt 



(32) 



We next distinguish between two cases: one where Opt is on path{yi,yn), and the second in which it is 
not. We start with the former case. In this case, it is easy to verify that Aopt > |M|(5^. In addition, due to 

c(/(x'),x')+l||Af|52 



Equation (1271) . it holds that 



sc(Opt',x') + |A/|<5^ 

gc(/(x),x) . 1 
sc{Opt, x) 



< li. Together with it follows that 



IMI 



< 1- + 
-2 



2^3" 



2\M\S 

\N\ 



sc{Opt',x') + \M\6^ 



(33) 



Next, we point out that sc{Opt', x') > This is due to the fact that each of Q and P must contain at 
least |A^|/3 agents each, and thus even if all the remaining agents would be located on Opt', and P and Q 
would be located as close to Opt as possible, the cost could not be lower than Therefore, we conclude 
that 



sc(/(x) 



IMI 



sc{Opt, x) 



< 1- + 
-2 



2^3" 



2\M\8 



^ + |M|52 



(34) 



A numeric analysis would find that the maximal value obtained for the right-hand side of ( [34b is 1.82085 . . . 
(obtained by fixing 6 = 0.54144 ... and |M| = ||A^|). 

It is left to analyze the latter case, namely the case in which Opt is not on path{yi,yn)- Due to Corollary |7l 
when moving the facility location from Opt to Opt, its induced cost is reduced by \N\d{Opt, Opt)"^. Let 
Ti,T2 € T(G, Opt), such that the agents from Af are in Ti. Due to Lemma |3l it holds that ^^^^^^ d{xi,Opt) 
gT2 d{xi,Opt). By substituting the left-hand side and the right-hand side, we get 



\M\{5 - d{0pt,0pt)) 



\M\)d{0pt,0pt)+ d{xi,Opt). 

Xi£T2 



(35) 



XV 



Since 'Ylx,(^T2 ^i^i, Opt) > and by rearranging we get tliat 



IMI 1 

l^S-->d{0pt,0pt). 



(36) 



Now we can bound Appt as follows. 

^Opt = [sc{Opt,x.) — sc{Opt' ,x')] — [sc{Opt,x) — sc{Opt,x)] 

> [sc{6pt,x.) - sc{6pt,x')] - [sc{6pt,x) - sc{Opt,x)] > \M\5^ 

Following the steps of the former case, it follows that 



iV 3 



gc(/(x),x) ^ 
sc(Opt,x) - 2 



IMI 



2^3" 



2\M\S 



M + |M|^2_|^r| (Ms -I 



(37) 



A numeric analysis would find that the maximal value obtained for the right-hand side of (l37l ) is 1.611 .. . 
(obtained by fixing 6 = 1.0446 . . . and |M| = 0.32|A^|). The assertion of the lemma follows. ■ 



xvi 



